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ABSTRACT 



A system and method for determining a root cause of error 
activity in a network is described herein. Root cause analysis 
includes the correlation between reported error activity for 
path, line and section entities along a provisioned channel in 
the network. Root causes can also be identified based upon 
the correlation of simultaneous error activity on various 
signal transport levels. Finally, root cause analysis can 
correlate error activity along vari ous pa tb^enjjlies. 

25 Claims 17 Drawing Sheets 
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SYSTEM AND METHOD FOR UNREPORTED 
ROOT CAUSE ANALYSIS 

CROSS-REFERENCE TO OTHER 
APPLICATIONS 

The following applications of common assignee contain 
some common disclosure: 

U.S. Patent Application entitled "System and Method for 
Identifying the Technique Used for Far-End Performance 
Monitoring of a DS1 at a Customer Service Unit," Ser. 
No. 08/671,028, filed Jun. 25, 1996. 

U.S. Patent Application entitled "System and Method for 
Formatting Performance Data In a Telecommunications 
System," Ser. No. 08/670,905, filed Jun. 26, 1996. 

U.S. Patent Application entitled "System and Method for 
Reported Root Cause Analysis," Ser. No. 08/670,844, 
filed Jun. 28, 1996. 

U.S. Patent Application entitled "Enhanced Correlated Prob- 
lem Alert Signals," Ser. No. 08/670,848, filed Jun. 28, 
1996. 

U.S. Patent Application entitled "Correlated Problem Alert 

Signals," Sen No. 08/673,271, filed Jun. 28, 1996. 
U.S. Patent Application entitled "Raw Performance Monitor 

Correlated Problem Alert Signals," Ser. No. 08/670,847, 

filed Jun. 28, 1996. 
U.S. Patent Application entitled "System and Method for 

Reported Trouble Isolation," Ser. No. 08/672,812, filed 

Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Unreported Trouble Isolation," Ser. No. 08/672,513, filed 
Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 

Monitoring Point Identification," Ser. No. 08/672,512, 

filed Jun. 28, 1996. 
U.S. Patent Application entitled "System and Method for 

End-to-End Threshold Setting," Ser. No. 08/670,845, filed 

Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Monitoring Point Activation," Ser. No. 08/672356, filed 
Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Tracking and Monitoring Network Elements " Ser. No. 
08/671,029, filed Jun. 25, 1996. 

The above-listed applications are incorporated herein by 
reference in their entireties. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to network man- 
agement systems, and more specifically is directed toward 
the determination of a root cause of error activity at one or 
more signal transport levels. 

2. Related Art 

The present application is a continuation of Application 
Ser. No. 08/668,516 filed Jun. 28, 1996 Entitled "System and 
Method for Unreported Root Cause Analysis". 

Telecommunication service providers (e.g., MCI Tele- 
communications Corporation) provide a wide range of ser- 
vices to their customers. These services range from the 
transport of a standard 64 kbit/s voice channel (i.e., DS0 
channel) to the transport of higher rate digital data services 
(e.g., video). Both voice channels and digital data services 
are transported over the network via a hierarchy of digital 
signal transport levels. For example, in a conventional 
digital signal hierarchy 24 DS0 channels are mapped into a 
DS1 channel. In turn, 28 DS1 channels are mapped into a 
DS3 channel. 
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Routing of these DS1 and DS3 channels within a node of 
the network is performed by digital cross-connect systems. 
Digital cross-connect systems typically switch the channels 
at the DS1 and DS3 signal levels. Transmission of channels 

5 between nodes is typically provided via fiber-optic trans- 
mission systems. Fiber-optic transmission systems can mul- 
tiplex a plurality of DS3 channels into a higher rate trans- 
mission over a single pair of fibers. In one example, signal 
formats for the fiber-optic transmission systems are defined 

to by the manufacturer. These proprietary systems are referred 
to as asynchronous transmission systems. 

Alternatively, a fiber-optic transmission system can 
implement the synchronous optical network (SONET) stan- 
dard. The SONET standard defines a synchronous transport 

15 signal (STS) frame structure that includes overhead bytes 
and a synchronous payload envelope (SPE), One or more 
channels (e.g., DS1 and DS3 channels) can be mapped into 
a SPE. For example, a single DS3 channel can be mapped 
into a STS-1 frame. Alternatively, 28 DS1 channels can be 

20 mapped into virtual tributaries (VTs) within the STS-1 
frame. 

Various STS-1 frames can be concatenated to produce 
higher rate SONET signals. For example, a STS-12 signal 
includes 12 STS-1 frames, while a STS-48 signal includes 

25 48 STS-1 frames. Finally, after an STS signal is converted 
from electrical to optical, it is known as an optical carrier 
(OQ signal (e.g., OC-12 and OC-48). 

An end-to-end path of a provisioned channel within a 
network typically traverses a plurality of nodes. This pro- 

30 visioned channel is carried over transmission facilities that 
operate at various rates in the digital signal hierarchy. For 
example, a provisioned DS1 channel may exist as part of a 
DS3, VT1.5, STS-1, STS-12, OC-12, and OC-4S signal 
along parts of the end-to-end path. This results due to the 

35 multiplexing and demultiplexing functions at each of the 
nodes. 

One of the goals of a network management system is to 
monitor the performance of the provisioned channel. Per- 

4Q formance of the provisioned channel can include various 
measures. One measure is the unavailability of the provi- 
sioned channel. Unavailability is generally defined as the 
amount (or fraction) of time that a channel is not operational. 
Various causes such as cable cuts can lead to channel 

45 downtime. Network responses to channel downtime can 
include automatic protection switching or various restora- 
tion procedures (e.g., digital cross-connect distributed 
restoration). 

Although unavailability is a major performance measure 
50 from a customer's standpoint, other performance measures 
can also be critical. For example, if a customer desires a 
digital data service for the transmission of financial data, the 
number of errored seconds or severely errored seconds may 
be a concern. 

55 In conventional network management systems, perfor- 
mance monitoring is accomplished in piecewise fashion. For 
example, consider a provisioned channel that traverses an 
end-to-end path comprising asynchronous transmission sys- 
tems and SONET transmission systems. Performance moni- 

60 taring information for these two types of transmission 
systems is typically maintained in separate databases. 
Moreover, the various types of transmission systems may be 
provided by multiple vendors. Each of these vendors may 
define their own separate performance monitoring process. 

65 For example, the vendor-controlled process may define the 
types of data that are retrieved from or reported by the 
individual network elements. 
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In this environment, comprehensive performance moni- FIG. 2 illustrates an exemplary circuit topology, 

toring analysis is difficult to accomplish. What is needed is FIGS. 3 and 7 illustrate flow charts of the correlation 

a network management system that can monitor provisioned process between path and line entities, 

channels at various points of the end-lo-end path and iden- fiGS. 4 ^ g illustrate flow charts of the correlation 

tify the root cause of problems that lead to observable error 5 process between line and section entities, 

activity. This capability allows a service provider to effi- piGS 5A _ 5D and 9A _ 9D iUusUate flow of the 

ciently resolve problems that lead to degradation of network between {cvc]s {n the signal transport 

performance. hierarchy. 

SUMMARY OF THE INVENTION to WGS. 6 and 10 illustrate flow charts of the correlation 

between path entities. 

The present invention satisfies the above mentioned needs mQ n iUustrates a block diagram of a computer useful 

by providing a comprehensive network management system for 1™^,,^ elements of the present invention, 
that can isolate a root cause of a problem in the network. In 

a first embodiment of the present invention, the root cause J5 DETAILED DESCRIPTION OF THE 

analysis operates on problem alert signals (PASs) generated PREFERRED EMBODIMENTS 

by monitoring points in the network. One example of a . . - . .j , 

p^blemalertsimal i s a threshold c rossing alert. A threshold The operation and administration of a service provider s 

craZTakH ^ a mgjuipje^^eFffl ance network is becoming increasingly complex. Network ele- 

P arl 55e7e1£e^^ 20 ™** °° ntinue f 10 evolve i? support of the provKior, 1 of a 

F - ; 77 t . t - Mt 2 0 wider range of services. The overriding goal of network 

In a second embodiment of the present myenuon he roo J aspects of the network are 
c,»* ? n^op«lM - m«^^r^vdy.S^J opera | ng according t0 ^ me service provider's design 
analysts can be used to identify faciuty jeratmg condiUons £ Ltomer's expectations, 
that could lead to a significant network problem. For *~ . 
example, the network facility could be operating at a point 25 A general open-ended framework isdcfined by the Inter- 
near the tolerance levels. Intermittent errors could therefore national Telecommun.cat.ons ^(ITU) Tele^mmumca- 
result due to temporary excursions beyond the tolerance Hons Management Network (TMN) standard. The TMN 
thresholds. If a statistical analysis identifies a potential standard defines a layered framework for a service provider 
problem, a raw performance monitoring PAS is generated. to implement its own network management process. 

In the present invention, root cause analysis seeks to 30 . FIG.-l-Ulusirates^.ne. ™*^^«^™ f*X 

identify sources of problems identified by a plurality of "eludes five layers 110, 120 130, 140 and 1150 Layer ISO 

PASs Li are visible to a layer in a network management is des.gnateda^ejwork element layer (NEL)^The NEL 

system. Analysis of the plurality of PASs include various is a phys.c^ayerth^cludes the va^us network ele- 

correlation processes. In one method, path PASs are corre- ™nts (e.gJfehronouj systems, SONET systems, etc.) 

lated to line PASs. In this manner, error activity identified on 35 «<*<> m thel^ort^fid routing of network traffic £g 

a provisioned channel can be isolated to a particular line DS1, DS3. OC-N , etc ). Each network element 151-156 in 

entity of the network. Further, in a second method of root NEL 150 can be des.gned to provide performance 

cause analysis, line PASs are correlated to section PASs. monitoring, alarm and status information to the higher layere 

This second correlation process allows error activity iden- in network management system 100 In particular, network 

lined on a line entity to be isolated to a particular section 40 ^"> ents U}-1S* are connected to one of the dement 

entity within the line entity. As par, of this" general correla- managers 141-143 m element management layer (EML) 

tion process, the root cause analysis correlates error activity 140. For example, network elemenU 151 and 152 are 

betwLn signal levels in the signal transport hierarchy. In connected .0 element manager 141. In th* manner, e cb 

this method of analysis, the highest transport level expert- network element manager 141-143 controls a portion of the 

encing error activity is identified. In this manner, the net- 45 fhysicai network embodied in NEL 150. 

work facilities that are the source of the problem can be Element managers 141-143 can retrieve information from 

identified network elements 151-156 periodically or upon a user 

An additional method of root cause analysis also corre- "I™*- Alternatively network elements 151-156 can 1 be 

lates error activity between path entities. Error activity on programmed to provide element managers 141-143 w,m a 

various path entiL may be caused by a common facility 50 predefined subset of network management .nformaUon at 

problem that is not detected by line entities within the path P«ede^ Ume interval Tte domain of an element man- 

entity. Identification of common line entities within the W "1-143 can be defined by a vendor s equ.pmc nL In 

various path entities eliminates redundant root cause analy- *>™ situations, me domain of an element manager 141-143 

F . « 1 ■ * 1 , t - t - is dictated by the geography in which network elements 

sis processing of the individual path entities. 7L« « • j 
. , . * - c 55 151-156 reside. 

The foregoing and other features and advantages of the man a g ement information is acquired by 

invention wil be apparent from the foUowmg more _par- * from 

ncular description of a preferred embodiment of the S management layer 

invent.on, as illustrated ui the accompanymg drawings. ^ ^ m oetwo± m 

BRIEF DESCRIPTION OF THE FIGURES 60 Network manager 131 is logically shown as a single entity. 

In implementation, network manager 131 can comprise one 

In the drawings, like reference numbers indicate identical or more sites. For example, multiple service centers (not 

or functionally similar elements. Additionally the left-most shown) can exist at different parts of the country (e.g., east 

digit of a reference number identifies the drawing in which coas t and west coast). In combination, these national-level 

the reference number first appears. $5 service centers combine to provide total visibility of the 

Fie. 1 illustrates the layers in a network management physical network in NEL 150. Network manager 131 can 

system. also be split among services and/or network elements. For 
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example, in one embodiment, a first network manager is the information stored in one or more layers of network 

dedicated to asynchronous parts of the network, a second management system 100. 

network manager is dedicated to DS1, DS3 and VT-n traffic, i n a first embodiment, root cause analysis is operative on 

and a third network manager is dedicated to STS-n and OC-n crr0 r activity that is reported from monitoring points that are 

traffic. 5 associated with network elements 151-156. Monitoring 

Generally, the logical entity identified as network man- points are described in greater detail in related applications 

ager 131 is a resource that is accessed by applications in entitled "System and Method for Monitoring Point 

service management layer (SML) 120. In FIG. 1, SML 120 Identification," Ser. No. 08/672,512, filed Jun. 28, 1996; 

is shown to inclge^veapplications 121-125. Specifically, "Systemaji ri M fi l hocLJ nr FnH-tn-Fnd Thrrshntri Smingr " 

SML 120 includes provisioning application 121, accounting/ 10 SerrNoT08/670,845, filed Jun. 28, 1996; and "System and 

billing application 122, security application 123, network Method for Monitoring Point Activation," Ser. No. 08/672, 

performance application 124, and fault management appli- 356, filed Jun. 28, 1996. 

cation 125. This listing of applications is provided without Generally, monitoring points are labeled as either primary 

limitation. Any other application that utilizes network man- monitoring points or secondary monitoring points. A basic 

agement data stored within NEL 150 can also be included. 15 monitoring point strategy for ag eneric end-to-en d_patb 

Note that elements of applications 121-125 also reside includes the pla^mennrfTnontto ririg pointy nearest! o the 

within EML 140 and NML 130. faciliryjSnaj^ 

Provisioning application 121 provides a customer inter- end points are designated as p rimary mo nitoring points 

face for the provisioning of various services. For example, (PMPs). A facility end point can be_thought_of as a generic 

a customer can indicate a desire for a DS1 digital data 20 customeMe^^ation poto 

service between network element 151 and network element local exchange carrier,-etc.). In the context of FIG. 2, PMPs 

155. Upon receipt of this customer request, provisioning can be associated with source 202 and destination 304. , 



application 121 relays the provisioning commands to net- In addition to the PMPs, a provisioned channel may also 
work manager 131. Network manager 131 then communi- include SMPs. SMPs are intermediate performance moni- 
cates with element managers 141, 143 and any other element 25 to ring data collection points. SMPs allow network manage - 
managers that control a part of the end-to-end path to set up me nt system 100 to isolate a network problem by providing 
the DS1 connection from network elements 151-155. performance monitoring information at intermediate sec- 
Applications 122^125 can similarly support a customer tions of the end-to-end path, 
interface by providing access to billing information, security 3Q Over a period of time, a monitoring point associated with 
information, performance information and fault manage- a network element may observe an excessive number of ESs 
ment information, respectively. Each of these applications on a received channel. If the monitoring point determines 
also access the resources that are stored within network that the error activity is significant, a problem alert signal 
manager 131 . (PAS) is reported to one of element managers 141-143. Note 
Finally, network management system 100 also includes 35 that any type of statistical analysis can be used by the 
business management layer (BML) 110. BML 100 includes monitoring point to determine whether to report a PAS to an 
logical entity 111. LogicaTentity 111 represents the general element manager 141-143. For example, the monitoring 
corporate policy of network management system 100. Cor- point could determine whether a number of ESs exceeds a 
porate policy 111 dictates the general business and contrac- predefined threshold. In the remainder of the description, 
tual arrangements of the service provider. 40 PASs are used to describe the general class of reported error 
Having identified the variqus_Jayers injietwork manage- ^jctivity.- 



ment system 100, a system and*TnetKocl for root cause Note that a monitoring point can generate a PASbased 
analysis is now described. Root cause analysis is generally upon" error activity measured at section, line, and path 
concerned with the identification of a source of a problem in terminating rwints^^f^^Jliustrates section, line and path 
the network. Problems in the network may or may not 45 entities in an exemplary channel (e.g., DS1, DS3, VT-n, 
involve actual system downtime. In other words, problems STS-1, STS-3c, STS-12c, etc.) provisioned between source 
in the network may manifest themselves as degradations in 202 and destination 204. In this example, the channel 
system performance. An example of performance degrada- originates at source 202, traverses multiplexer 210, regen- 
tion includes an increase in the bit error rate (BER). Bit erators 220 and 230, and cross-connect 240, and finally 
errors are typically measured in terms of errored seconds 50 terminates at destination 20<1 Each network element 202, 
(ESs) and severely errored seconds (SESs). An unacceptable 2Q4,_2 10, 220, 230 and 24 0 finserts and extracjg^ye rhead 
increase in the BER of the provisioned channel may prove information. In mejgj ggFcontext, section and line over- 
unsatisfactory to the customer. In many instances, customer tiead information is contained within the transport overhead 
expectations of performance of a provisioned channel are portion of a synchronous transport signal (STS) frame (not 
defined by the requirements contained within a service 55 shown). Path overhead information, on the other hand, is 
contract. The service contract may correlate system perfor- contained within the synch ronous payload envel ope (SPE) 
mance to the tarifiing structure. If the number of ESs, SESs information payload. Note 'mat the terms section, line and 
or unavailability of the service becomes excessive, customer path are used without limitation. As would be apparent to 
rebates may be in order. one of ordinary skill in the relevant art, the root cause 
In a competitive business climate, it is desirable for a 60 analysis described below could also be extended to other 
service provider to quickly identify and repair problems network transmission standards having analogous network 
leading to system downtime or degradation. If a root cause sectionalization. 

of a problem cannot be pinpointed, it will continue to affect In the following description, section, line and path entities 

system performance and a customers perceptions. The are used to refer to the portions of th enetwork that insert and 

present invention provides a system and method for identi- 65 exfa act sectio n, line and path overhead bytes, respectively, 

tying the root cause of network problems using a compre- For example, the OC-N link between regenerators 220 and 

hensive approach. This comprehensive approach analyzes 230 define a section entity. In the transmission from regen- 



04/16/2004, EAST Version: 1.4.1 



6,072,777 

7 8 

erator 220 to regenerator 230, regenerator 220^nserts)he sified into two types, line SESs (SESL) and path SESs 

section overhead bytes into the STS-N frame, performi an (SESP). If a PAS is generated based upon an excessive 

electrical-to-optical conversion of the STS-N signal, and number of SESLs, then the element manager knows that the 

transmits the OC-N signal to regenerator 230. Regenerator problem is local. Assuming that both ends of the line reside 

230 then receives the OC-N^igaal, performs an optical-to- 5 in the element manager's domain, the element manager can 

electrical conversion anc^extfactslne section overhead infor- conclude that the network problem can be isolated, 

mation. Regenerator 230 i =^firTuse the extracted section if a PAS is triggered based upon an excessive number of 

overhead information to determine the error performance of SESPs, then the element manager knows that the network 

the transmission from regenerator 220. For example, errors problem may not necessarily reside in the element manag- 

may have been created by a problem with the optical fiber, 10 er's domain. An additional determination must be made as 

a problem with a fiber-optic connector, etc. A monitoring to whether the originating and terminating network elements 

point associated with regenerator 230 tracks this error per- reside in the element manager's domain. If both reside in the 

formance over a monitoring period and reports the accumu- clement manager's domain, then the element manager 

lated results to an element manager 141-143. Note that knows that it can isolate the network problem, 

regenerators 220 and 230 extract only the section overhead 15 ^ WO uld be apparent to one of ordinary skill in the 

information. For this reason, the span between regenerators relevant art, implementation dependent decision trees can be 

220 and 230 define section entity 252 not line or path designed to determine whether a root cause of a network 

entities. problem can be isolated based upon a specific type of PAS. 

Next, consider the span between multiplexer 210 and If an element manager 141-143 determines that it does not 

cross-connect 240. In this span, multiplexer 210 and cross- 20 have enough information to identify the root cause of the 

connect 240 insert and e xtract bgj hjine and section over- network problem, then the root cause analysis must be 

head informa^onTNote-ThTt only^one^irection of thejwp- performed by the next highest layer in network management 

way communication is illustrated. As shown, the insertion system 100 (i.e., NML 130). This is illustrated by step 306. 

and*ext»clion-of-line-overriead information defines line Note that network manager 131 of NML 130 begins the 

entity 257. Line entity 257 includes section entities 25 isolation process as soon as a PAS is received from one of 

252-254. Finally, consider source 202 and destination 204. element managers 141-143. 

These elements insert^ajri extract path overhead Once control passes to NML 130, network manager 131 

information, mereby~3efining path entity 259. Path entity similarly determines whether it has enough information to 

^259 includes section entities_JJ51-255 and line entities isolate the problem. As noted above, in one embodiment 

256=258r — ' ~" ^ ^ 30 multiple network managers are used. In this case, root cause 

In considering the network as a whole, each channel analysis is performed by ao application in SML 120 using 
provisioned over a plurality of network elements defines its the resources contained in the multiple network managers, 
own section, line and path entities. Monitoring points asso- in the following description, assume that element man- 
dated with these network elements can monitor error activ- a g er 143 determines, at step 304, that it has enough infor- 
ity for section, line or path entities. Based upon the moni- mation to identify the root cause. Next, element manager 
tored error activity, a monitoring point can determine 143 determines, at step 308, whether a distinct line entity 
whether a PAS should be generated. This PAS is sent to an exists for the path entity. In other words, element manager 
element manager 141-143. The PAS can also be forwarded 143 determines whether the path entity includes more than 
to higher layers in network management system 100. ^ 0 ne line entity. If a path entity includes only one line entity, 

During a single monitoring period, monitoring points for then the line and path overhead information ^inserted at 

the various provisioned channels in the network can report source 202 is extracted af the same point. In the context of 

section, line and path PASs. Each layer in network manage- FIG. 2, this scenario would arise if multiplexer 210, regen- 

ment system 100 can analyze the reported PASs to determine erators 220 and 230, and cross-connect 240 did not exist, 

whether one or more root causes exist. The root cause 4J if element manager 143 determines, at step 308, that the 

analysis can simultaneously utilize one or more of the pam entity does not include distinct line entities, then the 

following methods of PAS analysis. path-line correlation process of FIG. 3 ends. In this case, the 

A first method of analysis is based upon the relationship root cause analysis cannot be further narrowed to a part of 

between path entities and fine entities. FIG. 3 illustrates a the path entity. However, if element manager 143 
flow chart of the path and line entity correlation process. The 50 determines, at step 308, that the path entity does include 

process begins at step 302 where a layer in network man- distinct line entities, then element manager 143 next 

agement system 100 receives a path PAS. Assume that a determines, at step 310, whether a line entity within the path 

monitoring point associated with destination 204 has entity has reported a corresponding PAS. 

reported a path PAS to element manager 143. Element if DO ne of the line entities within the path entity has 
manager 143 then determines, at step 304, whether it has 55 reported a corresponding PAS, then the path-line correlation 

enough information to identify the root cause of the network process ends. This results since the reported PASs within the 

problem. current monitoring period does not contain enough line 

Recall that each element manager 141-143 has access to entity information to further identify the root cause of the 
network management information received from a subset of path PAS. In one example, the root cause of the path PAS 
network elements 151-156. If the reported PAS was caused 60 may not have been identified by monitoring points associ- 
by a problem outside of an element manager's domain, then ated with the line entities. In another example, the monitor- 
that particular element manager cannot isolate the problem. ing points associated with the line entities may not have been 
Various sequences of decisions are used by each element activated. Subsequent activation of the monitoring points 
manager 141-143 to determine if the problem can be will allow further root cause analysis in a later monitoring 
isolated. 65 period. 

For example, consider the reported performance param- Finally, if element manager 143 determines, at step 310, 

eter of severely errored seconds (SESs). SESs can be clas- that a line entity has reported a corresponding PAS, then the 
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root cause of the network problem is narrowed to a particular 
line entity. Additional root cause analysis then proceeds at 
step 312 to further identify the root cause of the problem 
within the identified line entity. For example, the root cause 
could reside in a section entity within the line entity. Note 
that more than one line entity within a path entity can report 
a corresponding PAS. For example, errors within line enti- 
ties 256 and 257 of FIG. 2 could both be sufficient to trigger 
the reporting of a PAS. 

FIG. 4 illustrates a second method of root cause analysis 
that is based upon the correlation of PASs reported by line 
entities and section entities. The line-section correlation 
process begins at step 402 where a layer in network man- 
agement system 100 receives a line PAS. In the following 
discussion, assume that a monitoring point associated with 
cross-connect 240 has reported a line PAS to element 
manager 143. Element manager 143 then determines, at step 
404, whether it has enough information to identify the root 
cause of the network problem. For example, if a line PAS is 
generated by a monitoring point associated with cross- 
connect 240 of line entity 257, element manager 143 may 
determine that multiplexer 210 and regenerator 220 are in 
another element manager's domain. In this case, root cause 
analysis of the line PAS is performed at a higher layer in 
network management system 100 that has access to infor- 
mation for line entity 257. 

If element manager 143 determines, at step 404, that it has 
enough information to identify the root cause, then element 
manager 143 determines, at step 408, whether a distinct 
section entity exists for the line entity. In the example of line 
entity 257, distinct section entities 252-254 would exist. 

If element manager 143 determines, at step 408, that the 
line entity does not include distinct section entities, then the 
line-section correlation process of FIG. 4 ends. In this case, 
the root cause analysis cannot be further narrowed to a part 
of the line entity. However, if element manager 143 
determines, at step 408, that the line entity does include 
distinct section entities, then element manager 143 next 
determines, at step 410, whether a section entity within the 
line entity has reported a corresponding PAS. 

If none of the section entities within the line entity has 
reported a corresponding PAS, then the line-section corre- 
lation process ends. This results since the reported PASs 
within the current monitoring period does not contain 
enough section entity information to further identify the root 
cause of the line PAS. In a similar manner to the line -section 
correlation process described above, the path-line correla- 
tion process could result if the monitoring points associated 
with the section entities have not been activated. 

Finally, if element manager 143 determines, at step 410, 
that a section entity has reported a corresponding PAS, then 
the root cause of the network problem is narrowed to the 
particular section entity. Since the section entity is the lowest 
level of granularity, the root cause analysis is complete. 
Thus, at step 412, the section entity is identified as the root 
cause of the network problem. Again, note that more than 
one section entity within a line entity can report a PAS. 

As described above, a function of the root cause analysis 
is to correlate PASs. Path PASs can be correlated with line 
PASs and line PASs can be further correlated to section 
PASs. As part of this general correlation process a layer in 
network management system 100 correlates PASs between 
various signal levels in the transport hierarchy. Generally, 
when error activity is detected at a particular digital trans- 
port level, the correlation of that error activity to other error 
activity within the digital transport level hierarchy is not a 
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simple matter. For example, error activity detected at the 
DS3 level may not be detected at the STS-48 level. This may 
result due to the insufficient granularity of the error detection 
at the higher STS-48 level. An error at the STS-48 level may 

5 affect some or all of the lower transport levels within the 
STS-48 signal depending on the severity and the distribution 
of the error event. For example, bursty errors may affect only 
some of the lower transport levels while a continuous error 
may affect all lower transport levels. For this reason, per- 

10 formance monitoring data is collected at different hierarchial 
signal transport levels and error activity is correlated 
between the different signal transport levels. 

In this correlation process, the highest rate in the transport 
signal hierarchy experiencing simultaneous error activity is 

15 identified. FIGS. 5A-5D illustrate this correlation process of 
identifying the highest signal transport level that experiences 
simultaneous error activity. In the following description, 
assume that element manager 141 has received a DS1 path 
PAS from a network element in its domain. 

20 First, at step 502, element manager 141 determines 
whether the DS1 channel is mapped into a DS3 channel or 
into a VT-1.5 channel. If the DS1 channel is mapped into a 
DS3 channel, the process continues at step 506. At step 506, 
element manager 141 determines whether the DS3 in which 

25 the DS1 is mapped has reported a PAS. In other words, 
element manager 141 determines whether the DS3 in which 
the DS1 is mapped has simultaneous error activity occur- 
ring. 

In a similar manner to step 506, step 508 is invoked if 
30 element manager 141 determines, at step 502, that the DS1 
is mapped into a VT-1.5 channel. At step 308, element 
manager 141 determines whether the VT-1.5 in which the 
DS1 is mapped has reported a PAS. Again, element manager 
141 determines whether the VT-1.5 has simultaneous error 
35 activity occurring. 

If the determination at either step 504 or step 506 deter- 
mines that the VT-1.5 or DS3, respectively, has not reported 
a PAS, then element manager 141 reports the DS1 path PAS 
to network manager 131 in NML 130. Next, at step 510, the 
highest transport level is identified as a DS1 and the process 
ends. 

Returning to step 506, if the DS3 in which the DS1 is 
mapped does report a PAS, the process continues to step 512 

45 of FIG. 5B. At step 512, element manager 141 determines 
whether the DS3 is mapped into a STS-1. If the DS3 is not 
mapped into an STS-1, the DS3 PAS is reported to network 
manager 131 at step 516. Thereafter, the highest transport 
level is identified as a DS3. If element manager 141 

50 determines, at step 314, that the DS3 is mapped into an 
STS-1, element manager 141 then determines whether the 
STS-1 has reported a PAS. If a STS-1 PAS was not reported, 
the DS3 PAS is reported to network manager 131. 
Returning to step 504, if the VT-1.5 in which the DS1 is 

55 mapped does report a PAS, the process continues to step 520 
of FIG. 5C. At step 520, element manager 141 determines 
whether the STS-1 in which the VT-1.5 is mapped has 
reported a PAS. Here, no determination is made as to 
whether the VT-1.5 is mapped into a STS-1. This results 

60 because a VT-1.5 channel cannot exist in the network 
independently of a STS-1 channel. If the associated STS-1 
in which the VT-1.5 is mapped has not reported a PAS, the 
VT-1.5 PAS is reported to network manager 131 at step 522. 
Next, the highest transport level is identified as a VT-1.5 

65 channel at step 524. 

If element manager 141 determines at either step 514 or 
step 520 that a STS-1 has reported a PAS, the process 



04/16/2004, EAST Version: 1.4.1 



6,072,777 

11 12 

continues at step 526 of FIG. 5D. At step 526, element line PAS. At this point, the root cause analysis has narrowed 

manager 141 determines whether the STS-1 is mapped into the potential source of the error activity to line entity 257. 

an OC-48 system or an OC-3/12 system. An OC-48 system Further identification of the root cause of the problem is 

is generally used to transport STS-ls between two nodes in provided by an additional correlation between PASs for line 

the network. An OC-3 or OC-12 system, on the other hand, 5 entity 257 and PASs for section entities 252-254. 

is generally used to transport STS-ls between network Having described a correlation process between path, line 

elements within a particular node. For example, an OC-3 or anc j section PASs along the circuit topology of a single 

OC-12 fiber optic link could be used to transport STS-ls provisioned channel, a correlation process between indepen- 

between an OC-48 line terminating equipment (LTE) and a dent path PASs is now provided. As described above, each 

broadband digital cross-connect system (BBDCS). to pa th PAS that is received by a layer in network management 

If element manager 141 determines at step 526 that the system 100 could require independent root cause analysis 

STS-1 is mapped into an OC-3/12 system, clement manager consideration. Root cause analysis would generally seek to 

141 then determines, at step 328, whether the OC-3/12 narrow the consideration from a path level to a line or 

system has reported a PAS. If the OC-3/12 system has section level using the correlation processes outlined above^ 

reported a PAS, the highest transport level is identified as a 15 _ Note. howev er, that error activity detected for a path 

OC-3/12 at step 532. Conversely, if the OC-3/12 system has entity may not necessarily be identified at line or section 

not reported a PAS, a STS-1 PAS is reported to network entities. This results due to th e^error monitorin&^ rocess for 

manager 131 at step 534. Next, the highest transport level is ling and sec tion entities. Consider the span^defined by 

identified as a STS-1 at step 536. section entity 251 and line entity 256 of FIG. 2. In-this 

If element manager 141 determines at step 526 that the 20 example, the line and section overhead bytes are(ULgejlod ; 

STS-1 is mapped into an OC-48 system, element manager into the transport overhead part of a STS-1 framejt^ource 

141 then determines, at step 530, whether the OC-48 system 202. These line and section overhead bytes are^r^tetfby 

has reported a PAS. If the OC-48 system has reported a PAS, multiplexer 210. Included within ftejne and sEtian-over- 

the highest transport level is identified as a OC-48 at step head bytes is a bit interleaved fo^ ll Mljg) that 

538. Conversely, if the OC-48 system has not reported a 25 provides even jJantvNpver the previous S'rs-l Irame. The 

PAS, a STS-1 PAS is reported to network manager 131 at Ig^^s deternmeSby source 202 after the previous STS-1 

step 534. In this case, the highest transport level is identified frame is scrambled at source 202 prior to an electrical-to- 

as a STS-1 at step 536. optical conversion to the OC-1 signal. Upon receipt of the 

As the flow chart of FIGS. 5A-5D illustrate, this aspect of 30 ^SS> multiplexer 210 can determine whether a received 

root cause analysis seeks to identify the highest transport scrambled STS-1 contains any errors. r _ 

level that is experiencing simultaneous error activity. Note After the error calculation and subsequent unscrambling, 

that while all non-zero error activity is reported to the higher the STS-1 is provided as one of the low-speed inputs to 

layers in network management system 100, only the highest multiplexer 210. If multiplexer 210 is an OC-48 system, 48 

level path PAS is forwarded to NML 130. This reduces the 35 STS-ls are byte-interleaved multiplexed into a STS-48 

amount of trouble isolation processing that occurs at NML signal. After scrambling, the STS^8 signal is converted into 

130. Generally, the identification of the highest level path an OC-48 signal for transmission to regenerator 252. In this 

PAS allows network management system 100 to identify the process, note that errors can be generated internally by 

true source of the network error activity. Note also that the multiplexer 210. For example, a multiplexing card (not 

identification process of FIGS. 5A-5D need not begin at the 4Q shown) within multiplexer 210 could introduce errors into 

DS1 level. For example, the identification process could the SPE of one of the 48 STS-1 signals. Since these errors 

begin at step 512 after a DS3 PAS is reported. were introduced prior to the scrambling of the STS-48 

Having described the general correlation process between signal, the errors would not be detected by the line and 

signal transport levels, a simple example is provided with section entity error calculations. Tnese errors will be 

reference to FIG. 2. In this example, assume that a STS-1 45 detected by the path entity error detection process^ 

channel is provisioned between source 202 and destination Specifically, the PMP associated w,th destination 204 can 

204. Assume further that PMPs are associated with source ^viior the errors and generate a path PAS. 

202 and destination 204 and SMPs are associated with As noted above, multiplexer 210 can multiplex 48 STS-1 

multiplexer 210, cross-connect 240, and regenerators 220 s into the OC-48 signal. Each of these 48 STS-1 s could 

and 230. Both the PMPs and SMPs are activated. If errors 50 traverse independent paths throughout the network and 

are generated in any of network elements 202, 210, 220, 230, terminate on independent destinations. If multiplexer 210 

240 in the end-to-end path, a STS-1 path PAS may be generates errors on each of the 48 STS-ls, then potentially 

generated by a PMP associated with destination 204. After 48 separate STS-1 path PASs could be generated. Correla- 

receipt of a STS-1 path PAS, the correlation process con- tion between these path PASs could quickly identify multi- 

tinues at step 526 of FIG. 5D. At step 526, a layer in network 55 plexer 210 as the root cause of all the error activity, 

management system 100 determines whether the STS-1 is FIG. 6 illustrates the path PAS correlation process. This 

mapped into an OC-48 or OC-3/12 system. This determi- process begins at step 602 where a layer in network man- 

nation is based upon a retrieved topology of the provisioned agement system 100 receives a plurality of path PASs. Next, 

channel. In this example, the STS-1 is mapped into the a t step 604, the layer determines whether it has entire 

OC-N system that defines line entity 257. Assuming that the 60 information for all of the paths that have reported a PAS. If 

OC-N system is an OC-48 system, the correlation process the layer determines at step 606 that it does not have enough 

proceeds to step 530 where it is determined whether the information, the root cause analysis must be performed by 

OC-48 system has reported a PAS. one of the higher layers in network management system 100. 

The determination at step 530 can be answered in the This is illustrated by step 606. If the layer determines at step 

affirmative if the SMP that is associated with cross-connect 65 606 that it does have enough information, then the process 

240, reports a line PAS for the STS-48 signal. If this is the continues to step 608. At step 608, the layer determines 

case, the STS-1 path PAS is correlated to the OC-48/STS-48 whether a common line entity exists. If the layer determines 
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at step 608 that a common line entity does not exist then the 
process ends. Conversely, if the layer determines at step 608 
that a common line entity does exist, a possible root cause 
of the problem is identified at step 610. 

As described above, this possible root cause may rcpre- 5 
sent a convergence point of paths that generate path PASs. 
By identifying a convergence point, the layer bypasses 
redundant processing of the individual path PASs. Speed and 
efficiency of error detection and correction is thereby 
improved. 10 

In a second embodiment of the present invention, layers 
within network management system 100 use raw perfor- 
mance monitoring (PM) data to identify root causes of 
problems in the network. One goal of raw PM data analysis 
is to identify network problems at the earliest possible stage. 1 
In other words, it is desirable for a service provider to 
identify a potential network problem before any significant 
effects are felt by the customer. In this manner, the service 
provider is able to correct a problem before the customer is 
aware that a problem exists. 

One example of a potential problem is the existence of 
"dribble" errors. In this context, dribble errors are used to 
refer to a situation where a system is operating satisfactorily 
but not error-free. Errors that are reported by the monitoring 25 
points to the layers in network management system 100 are 
typically not large enough to cause a monitoring point to 
declare a PAS. Ordinarily, these non-zero error reports 
would not prompt any action by a service provider. 
However, these non-zero errors could indicate that a net- 3Q 
work element is operating at a point near the acceptable 
tolerance levels. Numerous examples exist. Intermittent 
errors could simply be caused by a dirty connector in a 
fiber-optic link. In other cases, synchronization shifts could 
cause jitter tolerance levels to be exceeded. In other 35 
examples, temperature or humidity variations could cause 
network element performance to periodically degrade. 

Regardless of the cause, intermittent non-zero error 
reports will be provided to the layers in network manage- 
ment system 100. Each layer in network management sys- 40 
tem 100 can independently analyze the existence of non- 
zero error activity over a period of time. Experience in the 
analysis of the non-zero error activity can lead to a corre- 
lation between specific patterns of error activity with the 
existence of specific network problems. Any means of 45 
statistical analysis can be used as a means for triggering a 
root cause analysis process. For example, if specific patterns 
of error activity are known to lead to certain failures, general 
pattern recognition systems (e.g., neural networks) can be 
used for triggering purposes. As noted above, this statistical 50 
analysis can be performed at each layer of network man- 
agement system 100 simultaneously. The only difference in 
processing is the scope of PM data that is available to an 
element in the particular layer in network management 
system 100. 55 

FIG. 7 illustrates a flow chart of the path and line entity 
correlation process in the second embodiment. The process 
begins at step 702 upon the generation of a raw PM path 
PAS. A raw PM path PAS is generated by one of the layers 
in network management system 100 upon an analysis of the 60 
reported raw PM path data. As noted above, analysis of 
patterns of error activity over time could cause a layer in 
network management system 100 to identify a potential 
network problem. Note that a raw PM path PAS is generated 
by a layer in network management system 100 while a 65 
regular path PAS is generated by a monitoring point asso- 
ciated with a network element. As further illustrated at step 
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702 of FIG. 7, a regular path PAS could also be used in the 
unreported root cause analysis. 

After a raw PM path PAS is generated or a regular path 
PAS is received by a layer in network management system, 
the layer determines whether it has enough information to 
determine the root cause. This process is represented by the 
loop of steps 704 and 706. After a layer in network man- 
agement system determines at step 706 that it has enough 
information to determine the root cause, the process contin- 
ues to step 708. 

If a layer determines, at step 708, that the path entity does 
not include distinct line entities, then the path-line correla- 
tion process of FIG. 7 ends. In this case, the unreported root 
cause analysis cannot be further narrowed to a part of the 
path entity. However, if the layer determines, at step 708, 
that the path entity does include distinct line entities, then 
the layer next determines, at step 710, whether a line entity 
within the path entity has reported non-zero error activity. 

If none of the line entities within the path entity has 
reported non-zero error activity, then the path-line correla- 
tion process ends. As noted above, this situation may result 
if the monitoring points associated with the line entities have 
not have been activated. Subsequent activation of the moni- 
toring points will allow further unreported root cause analy- 
sis in a later monitoring period. 

Finally, if the layer determines, at step 710, that a line 
entity has reported non-zero error activity, then the root 
cause of the network problem is narrowed to a particular line 
entity. Additional root cause analysis then proceeds at step 
712 to further identify the root cause of the problem within 
the identified line entity. 

FIG. 8 illustrates a second method of unreported root 
cause analysis that is based upon the correlation of error 
activity in line entities and section entities. The line -section 
correlation process begins at step 802 where a layer in 
network management system 100 generates a raw PM line 
PAS or receives a regular PAS from a monitoring point. 
After a raw PM line PAS is generated or a regular path PAS 
is received by a layer in network management system, the 
layer determines whether it has enough information to 
determine the root cause. This process is represented by the 
loop of steps 804 and 806. After a layer in network man- 
agement system determines at step 806 that it has enough 
information to determine the root cause, the process contin- 
ues to step 808. 

At step 808, the layer determines whether a distinct 
section entity exists for the line entity. If element manager 
143 determines, at step 808, that the line entity does not 
include distinct section entities, then the line-section corre- 
lation process of FIG. 8 ends. In this case, the unreported 
root cause analysis cannot be further narrowed to a part of 
the line entity. However, if the layer determines, at step 808, 
that the line entity does include distinct section entities, then 
the layer next determines, at step 810, whether a section 
entity within the line entity has reported non-zero error 
activity. 

If none of the section entities within the line entity has 
reported non-zero error activity, then the line-section corre- 
lation process ends. Finally, if the layer determines, at step 
810, that a section entity has reported non-zero error activity, 
then the root cause of the network problem is narrowed to 
the particular section entity. Since the section entity is the 
lowest level of granularity, the unreported root cause analy- 
sis is complete. Thus, at step 812, the section entity is 
identified as the root cause of the network problem. 

As described above, a function of the unreported root 
cause analysis is to correlate non-zero error activity. Raw 



04/16/2004, EAST Version: 1.4.1 



6,072,777 

15 16 

PM path PASs can be correlated with non-zero zero error error activity, a STS-1 raw PM PAS is reported to the next 

activity on a line entity and raw PM line PASs can be further highest layer at step 934. The highest transport level is then 

correlated to non-zero error activity on a section entity. In a identified as a STS-1 at step 936. 

similar manner to the reported root cause analysis, the If the layer determines at step 926 that the STS-1 is 

unreported root cause analysis correlates non-zero error 5 mapped into an OC-48 system, the layer then determines, at 

activity between various signal levels in the transport hier- step 930, whether the OC-48 system has reported non-zero 

archy. In this correlation process, the highest rate in the error activity. If the OC-48 system has reported non-zero 

transport signal hierarchy experiencing simultaneous error error activity, the highest transport level is identified as a 

activity is identified. FIGS. 9A-9D illustrate this correlation OC-48 at step 938. Conversely, if the OC-48 system has not 

process. In the following description, assume that a layer in 10 reported non-zero error activity, a STS-1 raw PM PAS is 

network management system 100 has either generated a raw reported to the next highest layer at step 934. In this case, the 

PM DS1 path PAS or received a regular DS1 PAS from a highest transport level is identified as a STS-1 at step 936. 

monitoring point. As the flow chart of FIGS. 9A-9D illustrate, the present 

The transport level determination process begins at step invention seeks to identify the highest transport level that is 

902 of FIG. 9 A. At step 902, the layer determines whether 15 experiencing simultaneous error activity. Note again that the 

the DS1 channel is mapped into a DS3 channel or into a identification process of FIGS. 9A-19D need not begin at 

VT-1.5 channel. If the DS1 channel is mapped into a DS3 the DS1 level. For example, the statistical analysis could 

channel, the process continues at step 906. At step 906, the identify a potential problem at the DS3 level. In this case, the 

layer determines whether the DS3 in which the DS1 is process described in FIGS. 9A-9D begins at step 912. 

mapped has reported non-zero error activity. In a similar 20 pic. 10 illustrates the process of correlating path PASs. 

manner to step 906, step 908 is invoked if the layer jlm process begins at step 1002 where a layer in network 

determines, at step 902, that the DS1 is mapped into a management system 100 identifies a plurality of path PASs. 

VT-1.5 channel. At step 908, the layer determines whether These path PASs include both raw PM path PASs and 

the VT-1.5 in which the DS1 is mapped has reported regular PASs. The raw PM path PASs may be generated by 

non-zero error activity. In the simplest example for either 25 anv 0 f me layers within network management system 100. 

scenario, the layer determines whether non-zero error activ- tne pi urant y 0 f p aln pass are identified, the layer 

ity has been recorded for the same monitored parameter determines at step 1004 whether it has entire information for 

identified in the original statistical analysis. all of me palns mat have reported a PAS. If the layer 

If the determination at either step 904 or step 906 deter- ^ determines at step 1004 that it does not have enough 

mines that the VT-1.5 or DS3, respectively, has not reported information, the root cause analysis must be performed by 

non-zero error activity, then the layer, at step 908, reports the 0 ne of the higher layers in network management system 100. 

DS1 raw PM PAS to the next highest layer in network This is illustrated by step 1008. If the layer determines at 

management system 100. Next, at step 910, the highest s t e p 1006 that it does have enough information, then the 

transport level is identified as a DS1 and the process ends. process continues to step 1008. At step 1008, the layer 

Returning to step 906, if the DS3 in which the DS1 is determines whether a common line entity exists. If the layer 

mapped does report non-zero error activity, the process determines at step 1008 that a common line entity does not 

continues to step 912 of FIG. 9B. At step 912, the layer exist then the process ends. Conversely, if the layer deter- 

determines whether the DS3 is mapped into a STS-1. If the mines at step 1008 that a common line entity does exist, a 

DS3 is not mapped into an STS-1, the DS3 raw PM PAS is ^ possible root cause of the problem is identified at step 1010. 

reported to the next highest layer at step 916. Thereafter, the In a similar manner to the reported root cause analysis, the 

highest transport level is identified as a DS3 at step 918. If correlation of path PASs allows a layer to bypass redundant 

the layer determines, at step 914, that the DS3 is mapped processing of the path PASs individually, 

into an STS-1, the layer then determines whether the STS-1 In one embodiment, the invention is directed to a com- 

has reported non-zero error activity at step 914. If non-zero 45 puter system operating as discussed herein. For example, 

error activity was not reported, the DS3 raw PM PAS is functions in each layer of network management system 100 

reported to the next highest layer. is implemented using computer systems. An exemplary 

Returning to step 904, if the VT-1.5 in which the DS1 is computer system 1102 is shown in FIG. 11. The computer 

mapped does report non-zero error activity, the process system 1102 includes one or more processors, such as 

continues to step 920 of FIG. 9C. At step 920, the layer 50 processor 1104. The processor 1104 is connected to a 

determines whether the STS-1 in which the VT-1.5 is communication bus 1106. 

mapped has reported non-zero error activity. If the associ- The computer system 1102 also includes a main memory 

ated STS-1 in which the VT-1.5 is mapped has not reported 1108, preferably random access memory (RAM), and a 

non-zero error activity, the VT-1.5 raw PM PAS is reported secondary memory 1110. The secondary memory 1110 

to the next highest layer at step 922. Next, the highest 55 includes, for example, a hard disk drive 1112 and/or a 

transport level is identified as a VT-1.5 channel at step 924. removable storage drive 1114, representing a floppy disk 

If the layer determines at either step 914 or step 920 that drive, a magnetic tape drive a compact disk drive, etc. The 

a STS-1 has reported non-zero error activity, the process removable storage drive 1114 reads from and/or writes to a 

continues at step 926 of FIG. 9D. At step 926, the layer removable storage unit 1116 in a well known manner, 

determines whether the STS-1 is mapped into an OC-48 60 Removable storage unit 1116, also called a program 

system or an OC-3/1 2 system. If the layer determines at step storage device or a computer program product, represents a 

926 that the STS-1 is mapped into an OC-3/12 system, the floppy disk, magnetic tape, compact disk, etc. As will be 

layer then determines, at step 928, whether the OC-3/12 appreciated, the removable storage unit 1116 includes a 

system has reported non-zero error activity. If the OC-3/12 computer usable storage medium having stored therein 

system has reported non-zero error activity, the highest 65 computer software and/or data. 

transport level is identified as a OC-3/12 at step 932. Computer programs (also called computer control logic) 

Conversely, if the OC-3/12 system has not reported non-zero are stored in main memory and/or the secondary memory 
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1110. Such computer programs, when executed, enable the 
computer system 1102 to perform the features of the present 
invention as discussed herein. In particular, the computer 
programs, when executed, enable the processor 1104 to 
perform the features of the present invention. Accordingly, 
such computer programs represent controllers of the com- 
puter system 1102. 

In another embodiment, the invention is directed to a 
computer program product comprising a computer readable 
medium having control logic (computer software) stored 
therein. The control logic, when executed by the processor 
1104, causes the processor 1104 to perform the functions of 
the invention as described herein. 

In another embodiment, the invention is implemented 
primarily in hardware using, for example, a hardware state 
machine. Implementation of the hardware state machine so 
as to perform the functions described herein will be apparent 
to persons skilled in the relevant art(s). 

While the invention has been particularly shown and 
described with reference to preferred embodiments thereof, 
it will be understood by those skilled in the relevant art that 
various changes in form and details may be made therein 
without departing from the spirit and scope of the invention. 

What is claimed is: 

1. A method, in a layer of a network, management system, 
for identifying a root cause of a problem in a provisioned 
channel in a network, the provisioned channel being routed 
through a plurality of network elements, the method com- 
prising the steps of: 

(1) receiving performance monitoring data gathered for 
section, line and path entities by a plurality of moni- 
toring points during, a monitoring period, each of said 
plurality of monitoring points being associated with a 
network element; and 

(2) identifying a correlation between any error activity in 
said section, line and path entities as indicated by said 
performance monitoring data that is received from at 
least two different ones of said plurality of monitoring 
points to locate a root cause of a problem in either said 
section, line or path entities of the provisioned channel. 

2. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data at a first signal 
transport level reported from a first monitoring point to 
determine whether a potential problem exists; 

(b) identifying a second signal transport level that expe- 
riences corresponding error activity, said error activity 
being reported by a second monitoring point, wherein 
said first signal transport level is mapped into said 
second signal transport level; and 

(c) identifying a third monitoring point upstream of said 
second monitoring point that reports corresponding 
error activity. 

3. The method of claim 2, wherein said step (b) is repeated 
until a highest transport level is identified. 

4. The method of claim 2, further comprising the step of 
discontinuing further processing for problem alert signals 
that are received by the layer in the network management 
system from monitoring points downstream from said first 
monitoring point, wherein said problem alert signals corre- 
spond to the error activity reported by said first monitoring 
point. 

5. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data reported by a 
monitoring point associated with a line entity to deter- 
mine whether a potential problem exists; 



10 



20 



25 



30 



35 



45 



50 



55 



60 



65 



(b) determining whether said line entity includes a plu- 
rality of section entities; 

(c) determining whether a monitoring point associated 
with a section entity within said line entity has reported 
corresponding error activity; and 

(d) initiating a problem handling process for said section 
entity, if a monitoring point associated with a section 
entity has reported corresponding error activity. 

6. The method of claim 5, further comprising the step of: 

(e) initiating a problem handling process for said line 
entity, if a monitoring point associated with a section 
entity has not reported corresponding error activity. 

7. The method of claim 5, further comprising a step before 
said step (b) of determining the topology of the line entity. 

8. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data reported by a 
monitoring point associated with a path entity to deter- 
mine whether a potential problem exists; 

(b) determining whether said path entity includes a plu- 
rality of line entities; 

(c) determining whether a monitoring point associated 
with a line entity within said path entity has reported 
corresponding error activity; and 

(d) initiating a problem handling process for said line 
entity, if a monitoring point associated with a line entity 
has reported corresponding error activity. 

9. The method of claim 8, further comprising the step of: 

(e) initiating a problem handling process for said path 
entity, if a monitoring point associated with a line entity 
has not reported corresponding error activity. 

10. The method of claim 9, further comprising a step 
before said step (b) of determining the topology of the path 
entity. 

11. The method of claim 1, wherein said step (2) com- 
prises the steps of: 

(a) analyzing performance monitoring data reported by 
monitoring points associated with a first and a second 
path entity to determine whether a potential problem 
exists; 

(b) determining whether said first and second path entities 
include a common line entity; and 

(c) initiating a problem handling process for said common 
line entity, if a common line entity exists. 

12. The method of claim 11, further comprising a step 
before said step (b) of determining the topology of said first 
and second path entities. 

13. A system for identifying a root cause of a problem in 
a provisioned channel in a network, the provisioned channel 
being routed through a plurality of network elements, com- 
prising: 

(1) means for receiving performance monitoring data 
gathered for section, line and path entities from a 
plurality of monitoring points during a monitoring 
period, each of said plurality of monitoring points 
being associated with a network element; and 

(2) means for identifying a correlation between any error 
activity in said section, line and path entities as indi- 
cated by said performance monitoring data that is 
received from at least two different ones of said plu- 
rality of monitoring points to identify a root cause of a 
problem in either said section, line or path entities of 
the provisioned channel. 

14. The system of claim 13, wherein said means for 
identifying comprises: 
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means for analyzing performance monitoring da la at a 
first signal transport level reported from a first moni- 
toring point to determine whether a potential problem 
exists; 

means for identifying a second signal transport level that 5 
experiences corresponding error activity, said error 
activity being reported by a second monitoring point, 
wherein said first signal transport level is mapped into 
said second signal transport level; and 

means for identifying a third monitoring point upstream 
of said second monitoring point that reports corre- 
sponding error activity. 

15. The system of claim 14, further comprising means for 
identifying a highest signal transport level that experiences 
corresponding error activity. 

16. The system of claim 14, further comprising means for 
discontinuing further processing for problem alert signals 
that are received by the layer in the network management 
system from monitoring points downstream from said first ^ 
monitoring point, wherein said problem alert signals corre- 
spond to the error activity reported by said first monitoring 
point. 

17. The system of claim 13, wherein said means for 
identifying comprises: 25 

means for analyzing performance monitoring data 
reported by a monitoring point associated with a line 
entity to determine whether a potential problem exists; 

means for determining whether said line entity includes a 
plurality of section entities; 30 

means for determining whether a monitoring point asso- 
ciated with a section entity within said line entity has 
reported corresponding error activity; and 

means for initiating a problem handling process for said 
section entity, if a monitoring point associated with a 35 
section entity has reported corresponding error activity. 

18. The system of claim 17, further comprising: 
means for initiating a problem handling process for said 

fine entity, if a monitoring point associated with a 
section entity has not reported corresponding error 
activity. 

19. The system of claim 17, further comprising means for 
determining the topology of the line entity. 

20. The system of claim 13, wherein said means for 
identifying comprises: 

means for analyzing performance monitoring data 
reported by a monitoring point associated with a path 
entity to determine whether a potential problem exists; 

means for determining whet her said path entity includes 
a plurality of line entities; 
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means for determining whether a monitoring point asso- 
ciated with a line entity within said path entity has 
reported corresponding error activity; and 

means for initiating a problem handling process for said 
line entity, if a monitoring point associated with a line 
entity has reported corresponding error activity. 

21. The system of claim 20, further comprising: 
means for initiating a problem handling process for said 

path entity, if a monitoring point associated with a line 
entity has not reported corresponding error activity. 

22. The system of claim 21, further comprising means for 
determining the topology of the path entity. 

23. The system of claim 13, wherein said means for 
identifying comprises: 

means for analyzing performance monitoring data 
reported by monitoring points associated with a first 
and a second path entity to determine whether a poten- 
tial problem exists; 

means for determining whether said first and second path 
entities include a common line entity; and 

means for initiating a problem handling process for said 
common line entity, if a common line entity exists. 

24. The system of claim 23, further comprising means for 
determining the topology of said first and second path 
entities. 

25. A computer program product, comprising: 

a computer usable system medium having computer read- 
able program code means embodied in said medium 
that identifies a root cause of a problem in a provisioned 
channel in a network, the provisioned channel being 
routed through a plurality of network elements, said 
computer readable program code means comprising: 

first computer readable program code means for causing 
a computer to effect a reception of performance moni- 
toring data gathered for section, line and path entities 
from a plurality of monitoring points during a moni- 
toring period, each of said plurality of monitoring 
points being associated with a network element; and 

second computer readable program code means for caus- 
ing a computer to effect an identification of a correla- 
tion between any error activity in said section, line and 
path entities as indicated by said performance moni- 
toring data that is received from at least two different 
ones of said plurality of monitoring points to identify a 
root cause of a problem in either said section, line or 
path entities of the provisioned channel. 
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